An Efficient VAD Based on a Generalized Gaussian PDF
نویسندگان
چکیده
The emerging applications of wireless speech communication are demanding increasing levels of performance in noise adverse environments together with the design of high response rate speech processing systems. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activity detector (VAD). This paper presents a new voice activity detector (VAD) for improving speech detection robustness in noisy environments and the performance of speech recognition systems. The algorithm defines an optimum likelihood ratio test (LRT) involving Multiple and correlated Observations (MCO). An analysis of the methodology for N = {2, 3} shows the robustness of the proposed approach by means of a clear reduction of the classification error as the number of observations is increased. The algorithm is also compared to different VAD methods including the G.729, AMR and AFE standards, as well as recently reported algorithms showing a sustained advantage in speech/non-speech detection accuracy and speech recognition performance.
منابع مشابه
A Real-Time DSP-Based System for Voice Activity Detection: Design and Implement
Most of the noise in speech communication lines can be considered as Gaussian white noise. Voice activity detection (VAD) in noisy environment is an important process in many speech signal processing algorithms. Unlike the other VAD algorithms, this paper proposes a simple and novel VAD algorithm based on the probability distribution function (PDF) of FFT magnitudes of both clean speech and Gau...
متن کاملSpeech Probability Distribution based on Generalized Gamma Distribution
In this paper, we propose a new speech probability distribution, two-sided generalized gamma distribution (GΓD) for an efficient parametric characterization of speech spectra. GΓD forms a generalized class of parametric distributions including the Gaussian, Laplacian and Gamma probability density functions (pdf’s) as special cases. All the parameters associated with the GΓD are estimated by the...
متن کاملSpeech probability distribution based on generalized gama distribution
In this paper, we propose a new speech probability distribution, two-sided generalized gamma distribution (GΓD) for an efficient parametric characterization of speech spectra. GΓD forms a generalized class of parametric distributions including the Gaussian, Laplacian and Gamma probability density functions (pdf’s) as special cases. All the parameters associated with the GΓD are estimated by the...
متن کاملThe Tail Mean-Variance Model and Extended Efficient Frontier
In portfolio theory, it is well-known that the distributions of stock returns often have non-Gaussian characteristics. Therefore, we need non-symmetric distributions for modeling and accurate analysis of actuarial data. For this purpose and optimal portfolio selection, we use the Tail Mean-Variance (TMV) model, which focuses on the rare risks but high losses and usually happens in the tail of r...
متن کاملSelection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection
A statistical model-based voice activity detection (VAD) is a robust algorithm in noisy condition to detect speech region from input signal by speech and non-speech statistical model such as complex Gaussian probability density function (PDF). The decision rule used in this VAD is based on Bayes’ rule and considers likelihood ratios (LRs) in whole frequency region. In this VAD, however, the Bay...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007